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Abstract 

We consider the problem of selecting a portfolio of assets that 
provides the investor a suitable balance of expected return and risk. 
With respect to the seminal mean-variance model of Markowitz, we 
consider additional constraints on the cardinality of the portfolio and 
on the quantity of individual shares. Such constraints better capture 
the real-world trading system, but make the problem more difficult to 
be solved with exact methods. 

We explore the use of local search techniques, mainly tabu search, 
for the portfolio selection problem. We compare and combine previous 
work on portfolio selection that makes use of the local search approach 
and we propose new algorithms that combine different neighborhood 
relations. In addition, we show how the use of randomization and of 
a simple form of adaptiveness simplifies the setting of a large number 
of critical parameters. Finally, we show how our techniques perform 
on public benchmarks. 
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1 Introduction 



The portfolio selection problem consists in selecting a portfolio of assets (or 
securities) that provides the investor a given expected return and minimizes 
the risk. One of the main contributions on this problem is the seminal work 
by |Markowi~tzl (|1952 ), who introduced the so-called mean-variance model, 
which takes the variance of the portfolio as the measure of risk. Accord- 
ing to Markowitz, the portfolio selection problem can be formulated as an 
optimization problem over real-valued variables with a quadratic objective 



sec 



Markowitz, 1959| ] for an introductory 



function and linear constraints 
presentation). 

The basic Markowitz' model has been modified in the recent literature 
in various directions. First, |Konno and Yamazaki| ( |1991| ) propose a linear 
versions of the objective function, so as to make the problem easier to be 
solved using available software tools, such as the simplex method. On the 
other hand, with the aim of better capturing the intuitive notion of risk, 
[Konno and Suzuki| (|1995| ) and Markowitz et al. ( 1993 ) studied more complex 
objective functions, based on the notions of skewness and semi-variance, 
respectively. Furthermore, several new constraints have been proposed, in 
order to make the basic formulation more adherent to the real world trading 
mechanisms. 

Among others, there are constraints on the maximal cardinality of the 
portfolio [ jChang et al., 20001 pienstock, 1996 ] and on the minimum size of 
trading lots [ Mansini and Speranza, T999| . Finally, [Yoshimotcj ( jjjjgfj) and 
plover et al\ ( |1996| ) consider multi-period portfolio evolution with transac- 
tion costs. 

In this paper we consider the basic objective function introduced by 
Markowitz, and we take into account two important additional constraints, 
namely the cardinality constraint and the quantity constraint, which limit 
the number of assets and the minimal and maximal shares of each individual 
asset in the portfolio, respectively. 



The use of local search techniques for the portfolio selection problem has 
been proposed by |llolland| (|1997[) and Chang et al. (|200 



In this paper, 



we depart from the above two works, and we try to improve their techniques 
in various ways. First, we propose a broader set of possible neighborhood 
relations and search techniques. Second, we provide a deeper analysis on 
the effects of the parameter settings and employ adaptive evolution schemes 
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for the parameters. Finally, we show how the interleaving of different neigh- 
borhood relations and different search techniques can improve the overall 
performances. 

We test our techniques on the benchmarks proposed by |Chang et al.\ , 
which come from real stock markets. 

The paper is organized as follows. Section [| introduces the portfolio 
selection problem and its variants. Section |3] recalls the basic concepts of 
local search. Section f| illustrates our application of local search techniques 
to the portfolio selection problem. Section || show our experimental results. 
Section |6| discusses related work. Finally, Section |7| proposes future work and 
draws some conclusions. 



2 Portfolio Selection Problems 



We introduce the portfolio select problem in stages. In Section |2J], we present 



the basic unconstrained version of Markowitz. In Section 12.21, we introduce 



the specific constraints of our formulation. Other constraints considered in 



the literature but not in this work are mentioned in Section 2.3 



2.1 Basic formulation 

Given is a set of n assets, A = {ai, . . . ,a n }. Each asset a« has associated 
a real-valued expected return (per period) r*, and each pair of assets (aj, 
has a real-valued covariance a^. The matrix a nxn is symmetric and each 
diagonal element an represents the variance of asset a«. A positive value R 
represents the desired expected return. 

A portfolio is a set of real values X = {xi, . . . ,x n } such that each Xj 
represents the fraction invested in the asset a^. The value Y%=i Sj=i &ij x i x j 
represents the variance of the portfolio, and it is considered as the mea- 
sure of the risk associated with the portfolio. Consequently, the problem 
is to minimize the overall variance, still ensuring the expected return R. 
The formulation of the basic (unconstrained) problem is thus the following 
Markowitz, TTO . 



n n 

ni'ii Y VijXiXj 

i=l j=l 
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s.t. ^2nxi > R (l) 

n 

E^ = i (2) 

i=l 

< Xi < 1 (« = l,...,n) (3) 

This is a quadratic programming problem, and nowadays it can be solved 
optimally using available toolsQ despite the NP-completeness of the underly- 
ing decision problem. 

By solving the problem as a function of R, we obtain the so-called uncon- 
strained efficiency frontier (UEF), that gives for each expected return the 
minimum associated risk. The UEF for one of the benchmark problems of 
|Chang et al\ ( |2000|) is provided in Figure [l] (solid line). 




1 For example, an online portfolio selection solver is available at pttp : //www-f p .mcs . 
anl . gov/otc/Guide/CaseStudies/port/ 
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2.2 Additional constraints 

In our formulation, we consider the following two additional constraint types: 

Cardinality constraint: The number of assets that compose the portfolio 
is limited. That is, a value k < n is given such that the number of z's 
for which X{ > is at most k. 

Quantity constraints: The quantity of each asset i that is included in the 
portfolio is limited within a given interval. Specifically, a minimum 
and a maximum 5t for each asset % are given, and we impose that either 

Xt = or e, < Xi < 

These two constraint types can be modeled by adding n binary variables 
z±, . . . , z n and the following constraints. 



i><* (4) 

tiZi <Xi< SiZi (i = l,...,n) (5) 
Zi G {0, 1} (6) 

The variable Zi equals to 1 if asset dj is included in the portfolio, z\ = 
otherwise. The resulting problem is now a mixed integer programming 
problem, and is it much harder to be solved using conventional techniques. 

We call CEF the analogous of the UEF for the constrained problem. 
Given that we do not solve the problem with an exact method, we do not 
actually compute the CEF, but what we call the ACEF (approximate con- 
strained efficiency frontier) . Figure |l| shows the ACEF (dashed line) we com- 
puted for the same instance for the values £j = 0.01, 8i = 1 (for % = 1, . . . , n), 
and k = 10. 

Notice that when the return is high, the distance to UEF is very small 
because typically large quantities of a few assets are used, and thus Con- 
straints (HI) and (H) don't come into play. 

In particular, in the instance of Figure [I] the number of assets in the 
ACEF is below 10 for R > 0.0068. 
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2.3 Variants of the problem 

In this section, we briefly discuss other variants of the portfolio selection 
problem and different constraint types not taken into account in this work. 



Itansaction lots: Mansini and Speranza ( 199!J| ) consider the constraint 



stating that assets can be traded only in indivisible lots of fixed size. 
In this case, the problem must be formulated in terms of integer-valued 
variables — as opposed to real- valued ones — that represent, for each 
asset, the number of purchased lots, instead of the real- valued ones. 

Given that assets are normally composed by units, this constraint is 
certainly meaningful; its practical importance however depends on the 
ratio between the size of the minimum trading lots and the size of the 
shares involved in the portfolio. 



Linear risk: Konno and Yamazaki (|1991|) propose to replace the objective 



function based on the variance with a linear risk function based on 
absolute deviation. This simplified model is easier to deal with due 
to the absence of the quadratic term. According to the authors, it 
provided results as accurate as in the basic model. The effectiveness 
of the linear objective function, though, has been criticized by [Simaan 

(TO) - 



Semi-variance: Markowitz et at. ( |1993 ) propose a different version of the 



objective function based on semi-variance rather than on variance. 
Semi-variance is a concept similar to variance, except that it takes 
into account only downward deviations rather than upward and down- 
ward ones. In fact, as Markowitz (|1959| , Cap. 9) first noted, an objective 



function based on variance has the drawback that it considers very high 
and very low returns equally undesirable. The use of semi-variance in- 
stead tends to eliminate very low returns, without worrying about the 
distribution of positive returns. 

Skewness: A more complex objective function is considered by Konno and 
coworkers [ [Konno et al., 1993| ; |Konno and Suzuki, 1995j , who introduce 



a third-order quantity called skewness. Skewness is represented by 
means ofannxnxn matrix, and contributes to the objective function 
with a cubic term. The authors also propose an algorithm to solve the 
resulting problem. 
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Multi-period optimization: Most of the literature refers to a one-period 
trade in which the investor has initially no asset shares. In the perspec- 
tive of a longer run, however the investor at each round has to decide 
in which assets to invest, based also on the assets that he/she already 
holds. [Yoshimoto] (|1996 ) considers the case of the presence of an initial 
portfolio and the notion of transaction costs. The transaction costs 
must be subtracted from the expected return, and thus the number of 
atomic sell or buy operations must be taken into account. 



3 Local Search 

Local search is a family of general-purpose techniques for search and opti- 
mization problems. Local search techniques are non- exhaustive in the sense 
that they do not guarantee to find a feasible (or optimal) solution, but they 
search non-systematically until a specific stop criterion is satisfied. 

3.1 Local Search Basics 

Given an instance p of a problem P, we associate a search space S with it. 
Each element s e S corresponds to a potential solution of p, and is called 
a state of p. An element s 6 S that corresponds to a solution that satisfies 
all constraints of p is called a feasible state of p. Local search relies on a 
function N, depending on the structure of P, which assigns to each s G S its 
neighborhood N(s) C S. Each s' G N(s) is called a neighbor of s. 

A local search algorithm starts from an initial state sq, which can be 
obtained with some other technique or generated randomly, and enters a 
loop that navigates the search space, stepping from one state Sj to one of its 
neighbors Si + \. The neighborhood is usually composed by the states that are 
obtained by some local change (called move) from the current one. 

We call s fi na i the state reached after the stop condition has been met. 
We write s' = s <8> rn to mean that s' is obtained from s applying the move 
m. 

Local search techniques differ from each other according to the strategy 
they use both to select the move in each state and to stop the search. In 
all techniques, the search is driven by a cost function f that estimates the 
quality of the state. 
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For search problems, the cost function generally accounts for the number 
of violated constraints, and thus the search is guided by the aim of minimizing 
/ and reaching the value for it. For optimization problems, / also takes 
into account the objective function of the problem. We call Sb es t the first 
state such that f(sbest) < f(s) for all s encountered in the search started 
in Sq- Depending on the search technique, Sb est may or may not coincide 
necessarily with Sf ina i. 

In conclusions, local search consists in defining a search space and a neigh- 
borhood relation, creating an initial state so, and moving from so to Sfi na i 
according to the chosen strategy. 

The most common local search techniques are hill climbing (HC), simu- 
lated annealing (SA), and tabu search (TS). We describe in more details TS 
which is the technique that gave best results for our application. 



3.2 Tabu Search 

Tabu search is a modern local search technique that has been successfully 
applied in many real-life problems. A full description of TS is out of the 
scope of this paper (see, e.g., |Glover and Laguna] , |1997| ). We now describe 



the formulation of the technique which has been used in this work. 

At each state Sj, TS explores exhaustively the current neighborhood 
N(si). Among the elements in iV(sj), the one that gives the minimum value 
of the cost function becomes the new current state s i+ i, independently of the 
fact whether /(s i+ i) is less or greater than /(sj). 

Such a choice allows the algorithm to escape from local minima, but 
creates the risk of cycling among a set of states. In order to prevent cycling, 
the so-called tabu list is used, which determines the forbidden moves. This 
list stores the most recently accepted moves. The inverses of the moves in 
the list are forbidden. 

The simplest way to run the tabu list is as a queue of fixed size k. That 
is, when a new move is added to the list, the oldest one is discarded. We 
employ a more general mechanism which assigns to each move that enters 
the list a random number of moves, between two values k min and k max (where 
k m i n and k max are parameters of the method), that it should be kept in the 
tabu list. When its tabu period is expired, a move is removed from the list. 
This way the size on the list is not fixed, but varies dynamically between 
kmin and k max . 

There is also a mechanism that overrides the tabu status: If a move m 
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gives a "large" improvement of the cost function, then its tabu status is 
dropped and the resulting state is acceptable as the new current one. More 
precisely, we define an aspiration function A that, for each value v of the 
objective function, returns another value v' for it, which represents the value 
that the algorithm "aspires" to reach from v. Given a current state Sj, the 
cost function /, and a neighbor state s' obtained through the move m, if 
f(s') < A(f(si)) then s' can be accepted as s i+ i, even if m is a tabu move. 

The stop criterion is based on the so-called idle iterations: the search 
terminates when it reaches a given number of iterations elapsed from the 
last improvement of the current best state. 



3.3 Composite local search 

One of the attractive properties of the local search framework is that dif- 
ferent techniques can be combined and alternated to give rise to complex 
algorithms. 

In particular, we explore what we call the token-ring strategy, which is 
a simple mechanism for combining different local search techniques and/or 
different neighborhood relations. Given an initial state sq and a set of basic 
local search techniques t\, . . . ,t q , that we call runners, the token-ring search 
makes circularly a run of each tj, always starting from the best solution found 
by the previous runner (or t q if i = 1). 

The full token-ring run stops when it performs a fixed number of rounds 
without an improvement by any of the solvers, whereas the component run- 
ners ti stop according to their specific criteria. 

The effectiveness of token-ring search for 2 runners has been stressed by 
several authors (see |Glover and Laguna| , |1997| ). In particular, when one of the 



two runners, say t 2 , is not used with the aim of improving the cost function, 
but rather for diversifying the search region, this idea falls under the name 
of iterated local search (see, e.g., |Sttitzle| , |1998| ). In this case the run with t 2 



is normally called the mutation operator or the kick move. 

For example, we used the alternation of a TS using a small neighborhood 
with HC using a larger neighborhood for the high-school timetabling problem 
|Schaerf, 1999] . 
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4 Portfolio Selection by Local Search 



In order to apply local search techniques to portfolio selection we need to 
define the search space, the neighborhood structures, the cost function, and 
the selection rule for the initial state. 

4.1 Search space and neighborhood relations 

For representing a state, we make use of two sequences L = {a^, . . . , %} 
and S = {xi x , . . . , Xi p } such that G A and x^ is the fraction of in 
the portfolio. All assets G" L have the fraction Xj implicitly set to 0. 
With respect to the mathematical formulation, having a« G L corresponds to 
setting Zi to 1. 

We enforce that the length p of the sequence L is such that p < k, that the 
sum of Xi t equals 1, and that < < 5^ for all elements in L. Therefore, 
all elements of the search space satisfy Constraints (0), (|3]), (|]), and (||). 
Constraint (|l|) instead is not always satisfied and it is included in the cost 
function as explained below. 

Given that the problem variables are continuous, the definition of the 
neighborhood relations refers to the notion of the step of a move m, which is 
a real- valued parameter q, with < q < 1, that determines the quantity of the 
move. Given a step q, we define the following three neighborhood relations: 
Other relations have been investigated, but did not provide valuable results. 

idR ([i]ncrease, [d]ecrease, [R]eplace): 

Description: The quantity of a chosen asset is increased or decreased. All 
other shares are changed accordingly so as to maintain the feasibility 
of the portfolio. If the share of the asset falls below the minimum it is 
replaced by a new one. 

Attributes: (a^, s, aj) with G A, s £ {f, [}, cij G A 

Preconditions: a« G L and a,j L 

Effects: If s = | then x% := x« • (1 + q), otherwise X{ := x% • (1 — q). All values 
Xk — £fc are renormalized so as to maintain the property that x^'s add 
up to 1. We renormalize x& — e& and not x^ to ensure that no asset 
rather then d{ can fall below the minimum. 
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Special cases: If s = J, and Xi(l — q) < 6$, then m is deleted from L and 
Oj is inserted with Xj = tj. If s = f and Xj(l + g) > <5j, then a;, is set 
to (J*. 

Reference: Revised version of |Chang etd\ (120001) . 



idID ([i]ncrease, [djecrease, [l]nsert, [DJelete): 

Description: Similar to id R, except that the deleted asset is not replaced 
and insertions of new assets are also considered. 

Attributes: (cti, s) with 04 E A, s € {], |, 

Preconditions: If s = J, or f then Oj € L. If s = <— > then a\ ^ L. 

Effects: If s = | then x« := X{ ■ (1 + q); if a = J. then x% := x% ■ (1 — g); 
if s = then aj : is inserted into L. The portfolio is repaired as 
explained above for id R. 

Special cases: If s = J, and Xi(l — q) < ei, then is deleted from L, and 
it is not replaced. If s = f and + q) > 5i, then Xi is set to 5j. 

TIP ([Transfer, [l]nsert, [D]elete): 

Description: A part of the share is transferred from one asset to another 
one. The transfer can go also toward an asset not in the portfolio, which 
is then inserted. If one asset falls below the minimum it is deleted. 

Attributes: (ai,aj) with cij € A, dj € A 

Preconditions: Oj £ L 

Effects: The share Xi of asset aj is decreased by q ■ Xi and Xj is increased by 
the same quantity. If a,j L than it is inserted in L with the quantity 
q ■ Xi. 

Special cases: The quantity transferred is larger than q-Xi in the following 
two cases: (i) If after the decrease of Xi we have that x^ < q then also 
the remaining part of Xi is transferred to aj. (ii) If cij L and q-Xi < €j 
then the quantity transferred is set to ej. 

Reference: Extended version of |Rolland| (£997]). 



Intuitively, id R and id ID increase (or decrease) the quantity of a single 
asset, whereas TID trasfers a given amount from one asset to another one. 

Notice that idR moves never change the number of assets in the portfolio, 
and thus the search space is not connected under idR. Therefore, the use of 
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id R for the solution of the problem is limited. The relation id ID in fact is a 
variant of id R that overcomes this drawback. 

Notice also that under all three relations the size of the neighborhood is 
not fixed, w.r.t. the size of L, but it depends on the state. In particular, it 
depends on the number of assets that would fall below the minimum in case 
of a move that reduces the quantity of that asset. For example, for id R, the 
size is linear, 2 • \L\, if no asset is such that 2^(1 — g) < e^, but becomes 
quadratic, \L\ + \L\ • (n — \L\), if all assets are in such conditions. 

We now define the inverse relations, which determines which moves are 
tabu. Our definitions are the following: For idR and id ID, the inverse of m 
is any move with the same first asset and different arrow. For TID, it is the 
move with the two assets exchanged. 



4.2 Cost function and initial state 

Recalling that all constraints but Constraint (|I|) are automatically satisfied 
by all elements of the search space, the cost function f(X) is composed by the 
objective function and the degree of violation of Constraint ([!]). Specifically, 
we define two components, fi(X) = max (0, J2i=i r i x i ~ an d fi{X) = 
J2?=i Sj=i &ij x i x ji which take into account the constraint and the objective 
function, respectively. The overall cost function is a linear combination of 
them: /(X) = w 1 f 1 (X) + w 2 f 2 (X). 

In order to ensure that a feasible solution is found, W\ is (initially) set to 
a much larger value than W2- However, during the search, w\ is let to vary 



according to the so-called shifting penalty mechanism (see, e.g., |Gendreau el 
\al\ , 1994 ): If for K consecutive iterations Constraint (|l|) is satisfied, Wi is 
divided by a factor 7 randomly chosen between 1.5 and 2. Conversely, if it is 
violated for H consecutive iterations, the corresponding weight is multiplied 
by a random factor in the same range (where H and K are parameters of 
the algorithm). 

Notice that given a state s and a move m the evaluation of the cost change 
associated to m, i.e. f(s ®m) — f(s) is computationally quite expensive for 
both idR and id ID, due to the fact that m changes the fraction of all assets 
in L. The computation of the cost is instead much cheaper for TID. 

The initial state is selected as the best among / = 100 random portfolios 
with k assets. However, experiments show that the results are insensitive to 
I. 
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4.3 Local search techniques 

We implemented all the three basic techniques, namely HC, SA, and TS, for 
all neighborhood relations. HC, which performs only improving and sideways 
moves, is implemented both using a random move selection and searching for 
the best move at each iteration (steepest descent). SA, which for the sake 
of brevity is not described in this paper, is implemented in the "standard" 
way described in [ |Johnson et al, 1989( 1. TS is implemented as described 



in Section |3.2| using a tabu list of variable size and the shifting penalty 
mechanism. 

We also implemented several token-ring procedures. The main idea is to 
use one technique t%, with a large step q, in conjunction with another one t 2 , 
with a smaller step. The technique t\ guarantees diversification, whereas t 2 
provides a "finer-grain" intensification. 

The step q is not kept fixed for the entire run, but it is allowed to vary 
according to a random distribution. Specifically, we introduce a further pa- 
rameter d and for each iteration the step is selected with equal distribution 
in the interval q — d and q + d. 

Due to its limited exploration capabilities, id R is used only for t%. Other 
combinations, of two or three techniques, have also been tested as described 
in the experimental results. 

4.4 Benchmarks and experimental setting 

We experiment our techniques on 5 instances taken from real stock markets.^] 
We solve each instance for 100 equally distributed values for the expected 
return R. 

We set the constraint parameters exactly as |Chang et al\ ( |2000| ): q = 0.1 



and Si = 1 for i = 1, . . . ,n, and k = 10 for all instances. 

Given that the constraint problem has never been solved exactly, we can- 
not provide an absolute evaluation of our results. We measure the quality of 
our solutions in average percentage loss w.r.t. the UEF (available from the 
web site). We also refer to the ACEF, which we obtain by getting, for each 
point, the best solution found by the set of all runs using all techniques. The 
ACEF has been computed using a very large set of long runs, and reason- 
ably provides a good approximation of the optimal solution of the constrained 
problem. 



2 Available at the URL http://mscmga.ms.ai.ac.uk/jeb/orlib/portfolio.html 
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Table [I| illustrates, for all instances, the original market, the average 
variance of UEF (multiplied by xlO 3 for convenience), and the percentage 
average of the difference between ACEF and UEF. 



No. 


Origin 


assets 


UEF 


% Diff 


1 


Hong Kong 


31 


1.55936 


0.00344745 


2 


Germany 


85 


0.412213 


2.53845 


3 


UK 


89 


0.454259 


1.92711 


4 


USA 


98 


0.502038 


4.69426 


5 


Japan 


225 


0.458285 


0.204786 



Table 1: The benchmark instances 

Notice that the problem for which the discrepancy between UEF and 
ACEF is highest is no. 4 (with 4.69%). For this reason we illustrate our 
results for no. 4, in which the differences are more tangible. 

Except for no. 1, all other instances give qualitatively similar results and 
they require almost the same parameter settings. Instance no. 1 instead, 
whose size is considerably smaller than the others, shows peculiar behaviors 
and requires completely different settings. Specifically, it requires shorter 
tabu list and much smaller steps. 



5 Experimental Results 

The code is written in C++ with the compiler gcc (version 2.96), and makes 
use of the local search library EasyLocalH — \- |Di Caspero and Schaerf 



pOOOj ], It runs on a 300MHz Pentium II using Linux. 

In the following experiments, we run 4 trials for every point. For each 
parameter setting, we therefore run 2000 trials (4 trials x 100 points x 5 
instances). Except for the first point of the UEF, in one of the four trials 
the initial state is not random, but it is the final state of the previous solved 
point of the UEF. The number of iterations is chosen in such a way that each 
single trial takes approximately 2 seconds (on a 300MHz Pentium II, using the 
C++ compiler egcs-2.91.66), and therefore each test runs for approximately 
an hour. 

We experimented with 20 different values of the step q. Regarding the step 
variability d, preliminary experiments show that the best value is q, which 
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means the step varies between and 2q. In all the following experiments, d 
is either set to (fixed step) or is set to q (random step). 

Regarding the parameters related to the shifting penalty mechanism, the 
experiments show that the performances are quite insensitive to their varia- 
tions as far as they are in a given interval. Therefore, we set such parameters 
to fixed values throughout all our experiments (H — 1, K — 20). 

5.1 Single solvers 

The first set of experiments regards a comparison of algorithms using the 
three neighborhood relations id ID, id R, and TID in isolation. Given that the 
search space is not connected under id R, the relation idR is run for initial 
states of all sizes from 2 to 10 (it is therefore granted a much longer running 
time). For the other two, idID and TID, we start always with an initial state 
of 10 assets. 

Table |2] shows the best results for TS for both fixed and random steps, 
and the corresponding step values. For TS, the tabu list length is 10-25, and 
the maximum number of idle iterations is set to 1000. 

The table shows also the best performance of HC and SA for TID and 
id ID. For the sake of fairness, we must say that the parameter setting of SA 
has not been investigated enough. 

The results in Table |2| show that TS works much better than the others, 
and TID works better than idR and id ID. They also show that the random- 
ization of the step improves the results significantly. 



Tech. 


Nhb 


Fixed Step 
step % Diff 


Random Step 
base step % Diff 


TS 


idID 


0.5 


6.31568 


0.4 


5.60209 


TS 


TID 


0.5 


5.42842 


0.3 


4.85423 


TS 


idR 


0.4 


5.4743 


0.4 


5.4621 


SA 


TID 


0.4 


53.7006 


0.4 


56.5798 


SA 


idID 


0.2 


118.698 


0.5 


113.735 


HC 


TID 


0.2 


29.2577 


0.2 


29.039 


HC 


idID 


0.2 


41.4734 


0.1 


41.0438 



Table 2: Comparison of simple solvers 



15 



5.2 Composite solvers 

Table |3] shows the best results for token-ring with various combinations of 
two or three neighborhoods all using TS and random steps. Notice that we 
consider as token-ring solver also the interleaving of the same technique with 
different steps. 



Runner 1 


Runner 2 


Runner 3 




Nbh 


Step 


Nbh 


Step 


Nbh 


Step 


% Diff 


TID 


0.4 


TID 


0.05 






4.70872 


TID 


0.4 


TID 


0.04 


TID 


0.004 


4.70866 


TID 


0.4 


idR 


0.05 






4.70804 


TID 


0.4 


idR 


0.05 


TID 


0.01 


4.71221 


idID 


0.4 


idID 


0.04 






5.06909 


idID 


0.3 


idID 


0.03 


idID 


0.003 


4.99406 


idID 


0.4 


idR 


0.05 






4.99206 


idID 


0.4 


idR 


0.04 


idID 


0.004 


5.16368 



Table 3: Comparison of composite solvers 

The table shows that the best results are obtained using the combination 
of TID and idR, but TID with different steps performs almost as good. This 
results are very close to the ACEF (4.69426%), which is obtained using also 
much longer runs (24 hours each). 

In conclusion, the best results (around 4.7%) are obtained by token ring 
solvers with random steps. Further experiments show that the most critical 
parameter is the size of the step of t\, which must be in the range [0.3, 0.6]. 
They also show that using alternation of fixed steps the best result obtained 
is 4.84883. 

5.3 Effects of constraints on the results 

We conclude with a set of experiments that highlights the role played by 
constraints ^ and [5] on the problem. Figure ^] shows the best results for 
instance no. 4 for different values of the maximum number of assets k (ej and 
Si are fixed to the values 0.01 and 1). 

The results show that the effect of the constraint decreases quite steeply 
when increasing k. The effect is negligible for k > 30. 
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Figure 2: Results for different values of the maximum cardinality (k) 



Figure 0, instead, shows how the quality of the portfolio decreases while 
increasing the minimum quantity (ej). In order to focus on the minimum 
quantity constraint, we use a high value for the maximum cardinality (k = 
20) so as to make the effect of the corresponding cardinality constraint less 
visible. 

We don't show the results for different values of Si because the constraint 
on maximum quantity is less important from the practical point of view. 



6 Related Work 



This problem has been previously considered by Chans et al\ ( |2000|) , who 



implemented three solvers based on TS, SA, and genetic algorithms (GA). 
Their experimental results suggest that GA and SA work better than TS. 
Even though the TS procedure is not completely explained in the paper, we 
believe that this "defeat" of TS in favor of SA and GA is due to the fact that 
their version of TS is not sufficiently optimized. 



The neighborhood relation used by |Chang et al\ is a variant of id R. The 
difference stems from the fact that in their case a move m is represented 
by only the pair (aj,s) and the replacing asset cij is not considered part of 
m, but it is randomly generated whenever necessary. This definition makes 
incomplete the exploration of the full neighborhood because the quality of 
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Figure 3: Results for different values of the minimum quantity (e* 



a move (a*, J,) may depends of the randomly generated aj. In our work, 
instead, all possible replaces aj are analyzed. In addition, the application of 
a move m = (a,, |) is non- deterministic, and therefore it is not clear which is 
the definition of the inverse of m, and the definition of the tabu mechanism. 
Finally, with respect to our version, their TS misses the following important 
features: shifting penalty mechanism, random step (they use the fixed value 
0.1), and variable-size tabu list. 

Even though CEang et al\ solve the same problem instances, a fair com- 
parison between their and our results is not possible for two reasons: 

First, they formulate Constraint (f|) with the equality sign, i.e. J2?=i %i = 
k, rather than as an inequality. As the authors themselves admit, constrain- 
ing the solution to an exact number of assets in the portfolio is not meaningful 
by itself, but it is a tool to solve the inequality case. They claim that the so- 
lution of the problem with the inequality can be found solving their problem 
for all values from 1 to k. Unfortunately, though, they provide results only 
for the problem with equality. 

Second, they do not solve a different instance for each value of R, but (fol- 
lowing Perold| , |1984|) , they reformulate the problem without Constraint ([I]) 
and with the following objective function: f(X) = Xfi(X) + (1 — \)f 2 (X). 
The problem is then solved for different values of A. The quality of each solu- 
tion is measured not based on the risk difference w.r.t. the UEF for the same 
return R, but using a metric that takes into account the distance to both axis. 
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The disadvantage of this approach is that they obtain the solution for a set 
of values for R which are not an homogeneously distributed. Therefore their 
quality cannot be measured objectively, but it depends on how much they 
cluster toward the region in which the influence of Constraints and (||) is 
less or more strong. In addition, these sets of points are not provided, and 
thus the results are not reproducible and not comparable. 

Rolland ( [L997|) considers the unconstrained problem therefore his results 
are not comparable. He introduces the TID neighborhood which turned out 
to be the most effective. Although, the definition of |Rolland| is different 
because he considers only transfers and no insertions and deletions. This 
is because, for the unconstrained problem, all assets can be present in the 
portfolio at any quantity, and therefore there is no need of inserting and 
deleting. The introduction of insert and delete moves is our way to adapt his 
(successful) idea to the constrained case. 

Rolland makes use of a tabu list of fixed length equal to 0.4 ■ n, thus 
linearly related to the number of assets. He alternates the fixed step value 
0.01 with the fixed value 0.001, shifting every 100 moves. Our experiments 
confirm the need for two (and no more than two) step values, but they show 
that those values are too small for the constrained case. In addition, for the 
constrained problem, randomization works better than alternating two fixed 
values. 



7 Conclusions and Future Work 

We compared and combined different neighborhood relations and local search 
strategies to solve a version of the portfolio selection problem which involves 
a mixed-integer quadratic problem. Rather than exploring all techniques in 
the same depth, we focussed on TS that turned out to be the most promising 
from the beginning. 

This work shows also how adaptive adjustments and randomization could 
help in reducing the burden of parameter setting. For example, the choice of 
the step parameter turned out to be particularly critical. 

We solved public benchmark problems, but unfortunately no comparison 
with other results is possible at this stage. 

In the future, we plan to adapt the current algorithms to different ver- 
sions of the portfolio selection problem, both discrete and continuous, and 
to related problems. Possible hybridization of local search with other search 
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paradigms, such as genetic algorithms, will also be investigated. 
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